Search CORE

22 research outputs found

Learning to Extract Motion from Videos in Convolutional Neural Networks

Author: BKP Horn
D Fleet
D Fortun
DJ Butler
DJ Heeger
EH Adelson
F Solari
GW Taylor
KG Derpanis
KG Derpanis
NC Rust
T Brox
T Brox
V Ulman
YA LeCun
Publication venue
Publication date: 27/01/2016
Field of study

This paper shows how to extract dense optical flow from videos with a convolutional neural network (CNN). The proposed model constitutes a potential building block for deeper architectures to allow using motion without resorting to an external algorithm, \eg for recognition in videos. We derive our network architecture from signal processing principles to provide desired invariances to image contrast, phase and texture. We constrain weights within the network to enforce strict rotation invariance and substantially reduce the number of parameters to learn. We demonstrate end-to-end training on only 8 sequences of the Middlebury dataset, orders of magnitude less than competing CNN-based motion estimation methods, and obtain comparable performance to classical methods on the Middlebury benchmark. Importantly, our method outputs a distributed representation of motion that allows representing multiple, transparent motions, and dynamic textures. Our contributions on network design and rotation invariance offer insights nonspecific to motion estimation

arXiv.org e-Print Archive

Crossref

Accuracy of Anthropometric Measurements by a Video-based 3D Modelling Technique

Author: B Bourgeois
C-Y Chiu
KG Derpanis
KW Streng
P Pandis
P Sebo
S Verwulgen
TA Perini
Y Ma
Y Ma
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/04/2020
Field of study

The use of anthropometric measurements, to understand an individual’s body shape and size, is an increasingly common approach in health assessment, product design, and biomechanical analysis. Non-contact, three-dimensional (3D) scanning, which can obtain individual human models, has been widely used as a tool for automatic anthropometric measurement. Recently, Alldieck et al. (2018) developed a video-based 3D modelling technique, enabling the generation of individualised human models for virtual reality purposes. As the technique is based on standard video images, hardware requirements are minimal, increasing the flexibility of the technique’s applications. The aim of this study was to develop an automated method for acquiring anthropometric measurements from models generated using a video-based 3D modelling technique and to determine the accuracy of the developed method. Each participant’s anthropometry was measured manually by accredited operators as the reference values. Sequential images for each participant were captured and used as input data to generate personal 3D models, using the video-based 3D modelling technique. Bespoke scripts were developed to obtain corresponding anthropometric data from generated 3D models. When comparing manual measurements and those extracted using the developed method, the accuracy of the developed method was shown to be a potential alternative approach of anthropometry using existing commercial solutions. However, further development, aimed at improving modelling accuracy and processing speed, is still warranted

Crossref

Heriot Watt Pure

Sheffield Hallam University Research Archive

Temporal Scale Selection in Time-Causal Scale Space

Crossref

Zero-Shot Task Transfer

Author: A Argyriou
AP Dawid
KG Derpanis
L Mihalkova
L Smith
R Held
S Thrun
SJ Pan
V Badrinarayanan
W Zhang
Publication venue: 'Cornell University Library'
Publication date: 01/01/2019
Field of study

In this work, we present a novel meta-learning algorithm TTNet1 that regresses model parameters for novel tasks for which no ground truth is available (zero-shot tasks). In order to adapt to novel zero-shot tasks, our meta-learner learns from the model parameters of known tasks (with ground truth) and the correlation of known tasks to zeroshot tasks. Such intuition finds its foothold in cognitive science, where a subject (human baby) can adapt to a novel concept (depth understanding) by correlating it with old concepts (hand movement or self-motion), without receiving an explicit supervision. We evaluated our model on the Taskonomy dataset, with four tasks as zero-shot: surface normal, room layout, depth and camera pose estimation. These tasks were chosen based on the data acquisition complexity and the complexity associated with the learning process using a deep network. Our proposed methodology outperforms state-of-the-art models (which use ground truth) on each of our zero-shot tasks, showing promise on zeroshot task transfer. We also conducted extensive experiments to study the various choices of our methodology, as well as showed how the proposed method can also be used in transfer learning. To the best of our knowledge, this is the first such effort on zero-shot learning in the task space

arXiv.org e-Print Archive

Crossref

Research Archive of Indian Institute of Technology Hyderabad

Structure-preserving dynamic texture generation algorithm

Author: C Zhu
D Marr
DJ Heeger
EH Adelson
EP Simoncelli
KG Derpanis
M Fahle
MA Goodale
R Péteri
Z Wang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Robot programming by demonstration: a novel system for robot trajectory programming based on robot operating system

Author: A Billard
Calinon S BillardAG
HD Cheng
KG Derpanis
MG Helander
P Meer
PJ Rousseeuw
S Schaal
SK Ong
Z Pan
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Human behavior recognition based on multi-feature fusion of image

Author: A Richard
B Yao
CY Zhang
D Wu
EP Ijjina
F Mori
Guoying Liu
Hongyu Zhou
J Candamo
KG Derpanis
L-M Xia
S Gao
S Lin
S Pereira
Xu Song
Z Feng
Z-H Chen
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Dynamic texture recognition using time-causal spatio-temporal scale-space filters

Author: B Ghanem
B Schiele
D Chetverikov
DH Hubel
F Yang
G Zhao
H Ji
JJ Koenderink
KG Derpanis
L Wang
O Linde
R Péteri
RC Nelson
RP Wildes
S Dubois
SR Arashloo
T Lindeberg
T Lindeberg
T Lindeberg
X Qi
Y Xu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

This work presents an evaluation of using time-causal scale-space filters as primitives for video analysis. For this purpose, we present a new family of video descriptors based on regional statistics of spatiotemporal scale-space filter responses and evaluate this approach on the problem of dynamic texture recognition. Our approach generalises a previously used method, based on joint histograms of receptive field responses, from the spatial to the spatio-temporal domain. We evaluate one member in this family, constituting a joint binary histogram, on two widely used dynamic texture databases. The experimental evaluation shows competitive performance compared to previous methods for dynamic texture recognition, especially on the more complex DynTex database. These results support the descriptive power of time-causal spatio-temporal scale-space filters as primitives for video analysis.QC 20170512Scale-space theory for invariant and covariant visual receptive fieldsTime-causal receptive fields for computer vision and modelling of biological visio

Publikationer från KTH

Crossref

Digitala Vetenskapliga Arkivet - Academic Archive On-line

Development of a fast transmission method for 3D point cloud

Author: C Yan
C Yan
C Yang
C Yang
Chenguang Yang
H Yu
J Cui
KG Derpanis
M Strintzis
R Menzies
RB Rusu
S-R Han
W He
W He
Wei He
X Xie
Y Lu
Zhijun Li
Zunran Wang
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Extricating Manual and Non-Manual Features for Subunit Level Medical Sign Modelling in Automatic Sign Language Classification and Recognition

Author: C Vogler
D Dahmani
Elakkiya R
H Junwei
HS Nagendraswamy
J Singha
KG Derpanis
M Yeasin
M-C Roh
MH Yang
P Kumar
R Elakkiya
R Elakkiya
S Li
S Liddell
Selvamani K
SGM Almeida
WC Stokoe
WW Kong
Y Jinxu
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref